CDS

Accession Number TCMCG017C33162
gbkey CDS
Protein Id OMO55478.1
Location join(3191..3404,4849..4977,5239..5369,5458..5508,6432..6548,6642..6684,6770..6822,7365..7484,8087..8149,8241..8319,8927..9036,9134..9187,9279..9346,9442..9544,9627..9829,9913..10009,11257..11328,11449..11534,11609..11725,11807..11891,12000..12167,12397..12507,12599..12685,12781..12908,12990..13072,13168..13273,13393..13539,14100..14220,14377..14490)
GeneID InterPro:IPR006597
Organism Corchorus olitorius
locus_tag COLO4_35966

Protein

Length 1019aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA215141, BioSample:SAMN03160584
db_source AWUE01022943.1
Definition Sel1-like protein [Corchorus olitorius]
Locus_tag COLO4_35966

EGGNOG-MAPPER Annotation

COG_category GOT
Description UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SEC
KEGG_TC -
KEGG_Module -
KEGG_Reaction R09304        [VIEW IN KEGG]
R09676        [VIEW IN KEGG]
KEGG_rclass RC00005        [VIEW IN KEGG]
RC00059        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
ko03036        [VIEW IN KEGG]
KEGG_ko ko:K09667        [VIEW IN KEGG]
EC 2.4.1.255        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00514        [VIEW IN KEGG]
ko04931        [VIEW IN KEGG]
map00514        [VIEW IN KEGG]
map04931        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGCTCTCCTTGCAGAGCGATCCTCGGCTGCAACAGTACCATCCTAGCCAGCTGCTTCAACAACAGCAGCAGCAGCAACAACAACAACAACAAGAAGTTCAATTGCTTCCATACAATGATGACTCACTGAGTCTGCACTCAGATTTTGGCGGTGCTATTGCGGCCGCTGCTGCTTCGTCCTCCTCGGCTTTGCCAAATCATAAGCTCTCCCAGGTTGATGATGACACACTCATGGCCCTTGCTCATCAAAAGTACAAGGCTGGTAACTACAAGCAAGCATTAGAACATAGCAGCGCCATCTATGAGAGGAACCCTCGTCGTACTGATAATCTTCTTCTCCTAGTTCTAAATGCCTCTGACATTTTTTCCCCTCAGTTGCATAATTATGATCAATGCATTGCAAAGAATGAAGAAGCCCTCAGAATTGATCCACAATTTGCAGAGTGCTATGGAAATATGGCAAATGCTTGGAAGGAGAAAGGAAATATTGATGCTGCAATCCGGTATTATTTGTATGCTATTGAGCTTCGGCCAAATTTTTCCGATGCATGGTCAAATCTAGCTAGTGCATACATGCGGAAAGGGAGGCTTAATGAGGCAGCTCAATGTTGCCGCCAAGCTCTTGCATTAAATCCCCGTTTGGTTGATGCTCACAGTAACCTTGGGAACTTAATGAAAATTCAAGTGTCTACCTATATTGCATATCTTGGTTGTTGTTCTTCAGGGTTGGCTGTCTGGGCTTACAATTGCTACCTTGAGGCTCTTCGTATACAACCTAATTTCGCAATTGCATGGTCAAATCTTGCTGGGCTTTTCATGGAGGCTGGGGATCTTAACAGGGCACTTCAATACTATAAGGAAGCAGTGAGGCTGAAACCGACGTTTTTTGATGCCTACTTAAACCTTGGAAATGTGTATAAGGCTCTGGGAATGCCTCAAGAGGCTATTGTATGCTATCAGCGTGCTCTTCAAGTGCGACCGGATTACGCTATGGCCTATGGCAATTTGGCTAGTATCTATTACGAACAACGTAACTTGGATTTGGCGATTCTCAATTATAGGAGAGCAATTGCTTTTGACTCAGGATTCTTGGAGGCATATAACAATTTGGGTAATGCTTTGAAGGATGCCGGAAGAGTTGATGAAGCAATGCAATGTTATCGGCAATGTCTTGCCCTTCAACCTAACCATCCTCAGGCACTTACGAATCTTGGGAATATATATATGGAATGGAATATGTTGAGTGCTGCTGCTTCATGCTACAAGGCAACTTTATCTGTAACAACAGGACTTTCTGCTCCCTTCAACAATTTAGCAATCATTTACAAACAGCAGGGTAATCTTTCAGATGCTATATCTTGTTACAATGAAGTTCTGCGCATTGATCCTACGGCAGCTGATGCACTTGTCAACCGAGGGAATACATATAAGGAGAGTGGAAGAGTAAATGAAGCTATTCAAGACTACATACGAGCTATTAACATTCGGCCATCGATGGCTGAAGCTCATACAAATTTGGCTTCAGCTTACAAGGACAGTGGACATGTTGAGGCTGCAATAAAAAGCTATAAGCAAGCACTGGTTCTTCGCCCTGATTTTCCAGAAGCAACCTGTAACCTTCTACACACATTACAGTGTGTTTGTGACTGGGAGGATCGAGAGAATAAATTTGTTGAAGTTGAAGGCATACTCAGGAGACAGATTAAGATGTCAGTCATTCCTAGCGTGCAACCTTTCCATGCAATAGCATATCCAATTGATCCGATTCTTGCGCTGGAAATCAGGCAAGTTGATCGTAAATATGCGGCACACTGCTCTGTTATTGCATCCCGTTATTCACTTCCTCCTTTCAACTATCCTGTGCGCCTCCCTGTGAAGAGTGATGGTGGGAGTGGACGCCTAAGAGTGGGATATGTGAGTAGTGATTTTGGTAACCATCCCCTTTCTCATCTTATGGGCTCAGTCTTTGGCATGCACAACAGAGAACACGTTGAGGTGTTCTGCTATGCATTGAGTCCAAATGATGGGACAGAATGGAGGTTGCGTATCCAGGCAGAAGTGGAGCACTTCATTGATGTATCATCCATGTCCTCTGACATGATTGCGAAGATGATAAATGAAGATAACATACAAATTCTTGTCAATCTCAATGGTTATACCAAGGGGGCAAGGAATGAAATATTTGCCATGCAACCTGCTCCTATTCAGATTTCTTACATGGGATTTCCCGGGACTACTGGTGCATCATACATACATTATTTGGTAACTGATGAGTTTGTCTCACCTCTTCGTTTCTCTCACATCTACTCTGAGAAGCTTGTTCATCTTCCTCATTGTTACTTTGTAAATGATTATAAGCAGAAAAACCGTGATGTCTTGGATCCAAACTGCTTGCCTAAGAGATCTGATTATGGACTACCTGAAGACAAATTTATCTTTGCATGTTTCAATCAGCTGTACAAGACGGATCCCGACATTTTCAACACATGGTGCAATATTCTTAAGCGTGTTCCAAATAGTGCTCTTTGGCTTCTTAGATTCCCAGCTGCAGGCGAGATGAGACTTCGTGCATATGCAACTCAACAGGGTGTGCGGTCAGATCAAATTATTTTTACAGATGTTGCCATGAAAAGTGAACATATAAGACGAAGTGCCTTGGCAGATCTCTTCCTTGATACACCTTTATGCAATGCACACACAACAGGCACCGATGTTCTATGGGCTGGGCTTCCCATGGTGACCCTTCCGCTTGAAAAGATGGCGACTAGAGTTGCTGGTTCATTGTGTCTGGCTACTGGTGTTGGGGAGGAGATGATTGTTAGCAGTTTGAAAGAATATGAAGAGAAGGCTGTCTCACTTGCTCTAAATCGCCCAAAGCTCCAAGATCTTTCTAATAAACTCAAGGAAGCCCGTATGACCTGCCCCCTTTTTGACACTATACGTTGGGTAAGGAACCTTGAACGAGCATATTATAAGATGTGGAATCTGCACTGCTCAGGTCAACAACCGCAACCCTTCAAAGTAACAGAGAATGATCAAGAATTTCCTTACGATAGATAG
Protein:  
MLSLQSDPRLQQYHPSQLLQQQQQQQQQQQQEVQLLPYNDDSLSLHSDFGGAIAAAAASSSSALPNHKLSQVDDDTLMALAHQKYKAGNYKQALEHSSAIYERNPRRTDNLLLLVLNASDIFSPQLHNYDQCIAKNEEALRIDPQFAECYGNMANAWKEKGNIDAAIRYYLYAIELRPNFSDAWSNLASAYMRKGRLNEAAQCCRQALALNPRLVDAHSNLGNLMKIQVSTYIAYLGCCSSGLAVWAYNCYLEALRIQPNFAIAWSNLAGLFMEAGDLNRALQYYKEAVRLKPTFFDAYLNLGNVYKALGMPQEAIVCYQRALQVRPDYAMAYGNLASIYYEQRNLDLAILNYRRAIAFDSGFLEAYNNLGNALKDAGRVDEAMQCYRQCLALQPNHPQALTNLGNIYMEWNMLSAAASCYKATLSVTTGLSAPFNNLAIIYKQQGNLSDAISCYNEVLRIDPTAADALVNRGNTYKESGRVNEAIQDYIRAINIRPSMAEAHTNLASAYKDSGHVEAAIKSYKQALVLRPDFPEATCNLLHTLQCVCDWEDRENKFVEVEGILRRQIKMSVIPSVQPFHAIAYPIDPILALEIRQVDRKYAAHCSVIASRYSLPPFNYPVRLPVKSDGGSGRLRVGYVSSDFGNHPLSHLMGSVFGMHNREHVEVFCYALSPNDGTEWRLRIQAEVEHFIDVSSMSSDMIAKMINEDNIQILVNLNGYTKGARNEIFAMQPAPIQISYMGFPGTTGASYIHYLVTDEFVSPLRFSHIYSEKLVHLPHCYFVNDYKQKNRDVLDPNCLPKRSDYGLPEDKFIFACFNQLYKTDPDIFNTWCNILKRVPNSALWLLRFPAAGEMRLRAYATQQGVRSDQIIFTDVAMKSEHIRRSALADLFLDTPLCNAHTTGTDVLWAGLPMVTLPLEKMATRVAGSLCLATGVGEEMIVSSLKEYEEKAVSLALNRPKLQDLSNKLKEARMTCPLFDTIRWVRNLERAYYKMWNLHCSGQQPQPFKVTENDQEFPYDR